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Bayesian approach for g-mode detection, or how to restrict our imagi- 
nation 
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Nowadays, g-mode detection is based upon a priori theoretical knowledge. By doing so, detection becomes more restricted 
to what we can imagine. De facto, the universe of possibilities is made narrower. Such an approach is pertinent for Bayesian 
statisticians. Examples of how Bayesian inferences can be applied to spectral analysis and helioseismic power spectra are 
given. Our intention is not to give the full statistical framework (much too ambitious) but to provide an appetizer for going 
further in the direction of a proper Bayesian inference, especially for detecting gravity modes. 
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1 Introduction 

Since the beginning of helioseismology, the detection of 
g modes has been the most challengin g quest in our field. 
There were claims of g-mode detection dDelache & Scherrerj 



tg- 



1983; Thomson et al . 1995), none of which were confirmed. 
Since the conception of the SOHO mission, one of the goals 
of this mission was to detect g modes. In 1997, follow- 
ing the lack of g-mode detection by SOHO experimenters, 
the Phoebus group was formed, with the aim of detecting 
g modes. The group set an up per limit to the g-mode am - 
plitude of 10 mm/s at 200 pHz dAppourchaux et al . 2000b). 
Since then, this lower limit has been even decreased down 
to about 4.5 mm.s -1 for a 10-year observat ion with a sin- 
glet, a nd down to 1 . 5 mm. s ~~ 1 for a multiplet (lElsworth et al. 
2006). 



Over the years, the Phoebus group has developed several 
techniques for g-mode detection. Some of which are based 
upon having no knowledge of the structure and dynamics 
of the Sun. There are also other techniques using a priori 
knowledg e of the Sun such as using rotational splitting pat- 
terns (lElsworth et al.ll2.Q06l and references therein). Recent 
g-mode detection claims rely upon a new detection tech- 
nique derived from the asymptotic prop erties of g-mode pe - 
riods derived from theoretical models (Gar cia et al.l l2007). 
Although these approaches look promising, they are all ba- 
sed on what we believe we know about the Sun. From that 
point of view, it is time that we turn to an approach that has 
been talked about a lot: a Bayesian approach to statistical 
inference. 

First, I will present a short introduction to what I believe 
I understand about a Bayesian approach versus a frequentist 
approach. In the second section, I will show simple exam- 



ples of this can be applied to Fourier analysis. In the third 
section, the Bayesian approach is applied to well known ex- 
ample in helioseismology. I then conclude and give a tenta- 
tive roadmap for the future. 



2 Bayesians versus Frequentists 

The controversy between Bayesians and Frequentists is re- 
lated to subjective versus objective probabilities. A frequen- 
tist thinks that the laws of physics are deterministic, while a 
Bayesian ascribes a belief that the laws of physics are true 
or operational. The subjective ap proach to probability was 
first coined by |Pe Finetti dl937l) . the reading of which is 
extremely enlightening. For the rest of us, the difference in 
views can be summarized by this quote from the Wikipedia 
encyclopedia: Whereas a frequentist and a Bayesian might 
both assign probability ~ to the event of getting a head when 
a coin is tossed, only a Bayesian might assign probability 
— ^ to personal belief in the proposition that there was life 
on Mars a billion years ago, without intending to assert any- 
thing about any relative frequency. In short frequentists as- 
sign probability to measurable events that can be infinitely 
measured, while Bayesians assign probability to events that 
cannot be measured, like the outcome of sport-related bets 
for instance. The Bayesian approach is then related to what 
Reverend Bayes would have understood as the degree of be- 
lief. The application of Bayes' theorem is then referred to as 
Bayesian inference. 



2.1 Bayes' theorem 
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The theorem of Bayes dl763h is known to any kid going to 
high school. It relates the probability of an event A given 
the occurrence of an event B to the probability of the event 
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B given the occurence of the event A, and the probability of 
occurrence of the event A and B. 

P(B|A)P(A) 



P(A|B) = 



P(B) 



(1) 



For example, the probability of having rain given the pres- 
ence of clouds is related to the probability of having clouds 
given the presence of rain by Eq. (1). The term prior prob- 
ability is given to P(A) (probability of having rain in gen- 
eral). The term likelihood is given to P(B|A) (probability 
of having clouds given the presence of rain) . The term pos- 
terior probability is given to P(A|B) (probability of having 
rain given the presence of clouds). The term normalization 
constant is given to P(B) (probability of having clouds in 
general). 

This theorem can then be transposed to anything that we 
know, or any information known a priori: 

P(fi|I)P(D|0,I) 



p(n|D,i) 



(2) 



P(D|I) 

where are the observables for which we seek the posterior 
probability, D is the observed data set, and I is the informa- 
tion. The prior probability of the observables is given by 
P(f2|I): this is the way to quantify our belief about what we 
seek. 

2.2 Nuisance parameters 

In theory, it looks quite simple but in practice the derivation 
of Eq (2) could be somewhat complicated. This is especially 
difficult when there are a subset of observables (tt b ) that 
are not known and need to be eliminated. In that case, it 
is required to integrate (or marginalize) over the unwanted 
parameters as follows: 



p(n a \D,i)= [ p(n a ,n b \D,i)<m b 

Replacing Eq. (2) in Eq. (3), we get: 
with the prior probability expressed using the product rule 



(3) 



dfl b (4) 



p(fi a ,n 6 |i) = p(n 6 |n a ,i)P(n a |i) 



(5) 



Equation (4) can then be integrated provided that the prior 
probability is assigned. 

2.3 Role of the prior 

If the parameters fi Q and fl b are supposed not to be cor- 
related, the prior probability P(fl a ,il b \T) (Eq. 5) can then 
simply be expressed as: 

p(n a ,n b \i) =p(fi |i)p(fi b |i) (6) 

The prior probability will then express what we believe we 
know (or not) about the parameters. The most obvious prior 
probability is the one that is uniformly distributed over some 
range of the parameters fl b of interest. The choice of the 



prior is related to the amount of information at our disposal. 
The role of the prior and its impact on the p osterior proba- 
bility should ideally be as small as possible. ISivia & David 
( 1 19941) show the impact of various priors on the outcome 
of a Bayesian analysis. The ideas developed in this article 
are also used by Thierry Toutain (private communication) 
for inferring hi gh frequency p-mode splitt ing hampered by 
mode blending dAppourchaux et al.ll2000al) . 

This discussion of the role of the prior is beyond the 
scope of this article. Here I am just touching the tip of the 
iceberg. I encourage the reader to seek other sources of in- 



formation ( Javnesl 



1987). 



2.4 Parameters estimation and error bars 

As soon as the posterior probability is known, we can derive 
an estimate of the parameter using: 

<n a >= J n a p(n a \D,i)dn a (7) 

with the following rms error: 

<?n a = j{Sl a - < ft a >) 2 P(n a |D,I)dft a (8) 

This latter expression is used in the next section for the case 
of spectral analysis. 

3 Spectral analysis revisited 



Bretthorstl dl988l) . using a Bayesian approach to the analy- 
sis of the time series of a pure sine wave embedded in noise 
having a gaussian distribution, demonstrated that the poste- 
rior probability for the angular frequency u of the sine wave 
can be written as: 

P(w|D,o-,I) oc e 2 ^ 1 (9) 

where D are the data (di taken at time ti), a is the rms value 
of the noise assumed to be known, and C(ui) is the so-called 
Schuster periodogram given by: 



cm = — 



N 

!E< 

1=1 



(10) 



This periodogram that is today c alled the Discre te Fourier 
Transform was fi rst derived by ISchusterl d 1 897b . Follow- 
ing this approach, |javnesldl987D demonstrated using Eq. ([8]) 
that for a pure sine wave of amplitude A, the rms error on 
the frequency v = lo/2-k is given by: 

r V6 a 1 

W = 7=7^ (11) 

7T aVnt 

where T is the observing time and N is the numbe r of sam- 
ples ta ken. This formul a is the sam e as given by ICuypers 
dl987h and derived bv lKoenl dl999h . This equation shows 
that the Rayleigh criterion (1/T) for frequency resolution is 
very pessimistic compared to the precision with which fre- 
quencies can be measured. This feature provided by pure 
sine wave will be fully used by the CoRoT Data Analysis 
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Team for classical stellar pulsators as outlined by AppourchauxHeatfeJ assumed, unlike Nigam et al. (1998), that the mode 
(2006). 



This revisitation of spectral analysis by Bretthorst is ex- 
tremely useful when one wants to understand how to apply 
a Bayesian analysis to helioseismology. 

4 Bayesian inference for astero- and 
helioseismology 

Bayesi an inference has rec ently been used in asteroseismol 



has no asymmetry. Replacing Eqs. (13) and (14) in Eq. (12), 
the posterior probability can then be expressed as: 



JV 



P(fi s |D,I)cxP(n s |I)J] 



1 



(16) 



If I assume that we know the mode amplitude A, the mode 
linewidth T, the noise B, I obtain after marginalizatiorO] 
over these latter variables the following: 



N 



ogy by Brewer et al. 
2007; ICarrier et all 



d2007h and applied to several stars feedding et M|ol D > : ) « p { v o 1 1) ]J 



2007). Unfortunately, the assumptions 



(17) 



about the stochastic nature of the mode excitation are com- 
pletely ignored in the formulation of the Bayesian inference, 
even though simulated spectra do integrate the randomness 
of the excitation. 

The stoc hastic nature of the mode excitation has been 
known since ( Woodard|l984 ). It leads to the formulation by 
Duvall & Harvey ( 19861) of the fitting of the p-mode spec- 
trum by Maximum Likelihood Estimation. In our case, I use 
this formulation in a very similar manner but instead of ex- 
pressing the posterior probability of, say, the frequency of a 
mode, as a function of the samples taken in time, I will use 
the samples taken in frequency. This is because the stochas- 
tic nature of the excitation of the harmonic oscillator, rep- 
resenting an eigenmode, prevents us from separating the in- 
strumental noise and the solar noise. This separation, as we 
will see later on, is only possible in the Fourier spectrum. 

We apply Eq. (2) to the simple example of a single mode 
stochastically excited. In our case, I can then write: 

p(o 5 |i)p(D|n s ,i) 



p(n.|D,i) = 



(12) 



P(D|I) 

with H s = (l>o, r, A, B), where vq is the mode frequency, 
r is the mode linewidth, A is the mode amplitude and B is 
the noise. The prior probability is given by: 

P(n s \I) = P(v \l)P(T\T)P(A\I)P(B\T) (13) 
where here I assume that the information about vq has no 
relation to the information we have about V, A and B, and 
also for all other pairs of parameters. This is not quite cor- 
rect for A and V but this is the choice of the prior that I 
made here. With the assumption of stochastic excitation of 
the modes, it is known that the power spectrum of the eigen- 
mode is a x 2 with 2 d.o.f wit h a mean given essentia lly by 
the mode profile plus noise iDuvall & Harvey dl986h . This 
assumption is sufficient for deriving the likelihood as: 



/v 



p(D|n„i) = U 



i 



(14) 



where N is the number of samples used in the power spec- 
trum, vi is the frequency at sample i, Sj is the observed 
power spectrum at frequency z/,-, and S(v) is the mean pow- 
er spectrum. In the case of single mode with no correlation 
with the solar background, I can write: 

S(v) = — + B (15) 



1 



2(i/-^o) 

r 



This equation is almost the same as the likelihood used for 
fitting helioseismic power spectra by maximization. 

4.1 Applications to LOI and GOLF data 

For testing the Bayesian inference, I used almost 10 years of 
coeval SOHO data from the LO:0 and GOLF^ instruments 
for testing the Bayes inference. I applied Eq. (17) to the 
GOLF data for which we know modes have been detected, 
and to the LOI data for which modes have not been detected 
below 2000 /iHz. It is well known that GOLF using solar 
radial velocities can detect more easily modes below 2000 
pHz. So the Bayesian approach to the LOI data is here ideal 
because we know that modes exist in this frequency range, 
but they are almost impossible to detect directly. In addition, 
solar models predict that low frequency modes are rather 
insensitive to surface effects. Therefore the theoretical un- 
certainty provided by the model of the atmosphere is here 
alleviated, and the frequency of the mode is only bounded 
by the uncertainties in the model of the internal structure of 
the Sun. 

Figures 1 to 3 show results for three typical cases. Fig- 
ure 1 is for a case where modes are easily detected in both 
instruments. The error bars are typical of what you can also 
get using MLE estimators. Figure 2 shows a very interest- 
ing case where I can detect with the LOI instrument a mode 
at a very low frequency close to 1500 /iHz. In that case the 
error bar on the frequency is quite large. Figure 3 shows the 
detection of a mode below 1000 mHz with the GOLF in- 
strument that was also detected bv lChaplin et alj d2002l) . On 
the other hand, there is no detection in the LOI because the 
posterior probability is now commensurate with prior prob- 
ability that stated that the mode frequency was uniformly 
distributed over that specific range of 2 /iHz. In that latter 
case, the rms error of the frequency is compatible with a 
frequency uniformly distributed over 2 pHz; the rms in that 
case being close to 2/y/l2. 

Although the results for low frequency low degree p 
modes demonstrated that the Bayesian approach could be 
useful, the application in the g-mode range lead to results 
for GOLF similar to those of the LOI for the p modes (See 



We assumed that the prior probabilities are Dirac 8 distributions 
Luminosity Oscillations Imager, Appourcha ux et all i 19971) 
Global Oscillations at Low Frequency, Gabriel et al. 1 1997) 
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Fig. 1 (Top) Power spectra smoothed over 1/iHz (315 bins) of the GOLF and LOI spectra for an I = mode. (Bottom) 
Posterior probability for GOLF and LOI data assuming an amplitude of 1 and a noise of 1, and a linewidth of 1 fiHz. The 
mean and rms error of the frequency as derived from Eqs. (7) and (8) are indicated on the top of the diagrams. 




Fig. 2 (Top) Power spectra smoothed over 0.2 /xHz (63 bins) of the GOLF and LOI spectra for an I = mode. (Bottom) 
Posterior probability for GOLF and LOI data assuming an amplitude of 1 a noise of 1, and a linewidth of so 0.2 pHz. The 
mean and rms error of the frequency as derived from Eqs. (7) and (8) are indicated on the top of the diagrams. 



GOLF 972.400 LOI 972.400 




-1.0 -0.5 0.0 0.5 1.0 -1.0 -0.5 0.0 0.5 1.0 

central frequency (in /iHz) central frequency (in /iHz) 



Fig. 3 (Top) Power spectra smoothed over 0.01 ^Hz (13 bins) of the GOLF and LOI spectra for an I = mode. (Bottom) 
Posterior probability for GOLF and LOI data assuming an amplitude of 5 and 0.2, respectively; a noise of 1 , and a linewidth 
0.01 [iHz. The mean and rms error of the frequency as derived from Eqs. (7) and (8) are indicated on the top of the diagrams. 
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Figure 3). It is quite clear that additional information needs 
to be included before concluding for the g-mode range. The 
Bayesian approach would ideally be suited to the rec ent 
claims of g-mode detection made by lGarcfa et al.1 (120071) . 



5 Conclusion 

The Bayesian approach to the analysis of solar power spec- 
tra seems extremely promising. One must not forget that 
many aspects have been neglected such as marginalization. 
On this latter aspect, either one is fond of integration and 
derives the integrals of Eq. (4), or one uses the so-called 
Markov Chain M onte Carlo (MCMC) algorithm as used by 
Gregory! (|2005). So far, I have not delved into the subject 
but I sense that the MCMC algorithms will need to be un- 
derstood before we can proceed to the proper application of 
the Bayesian approach to g-mode detection. 

Last but not least, this paper could be very far from 
a proper and accurate description of Bayesian inference. 
Here I wanted to attract the helioseismic reader to a field 
that has recently been fast developing in astrophysics. I hope 
this paper will trigger the interest of the reader for a genuine 
treatment of the Bayesian inference. 
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